Faking Errors to Avoid Making Errors: Machine Learning for Error Detection in Writing

نویسنده

  • Jonas SJÖBERGH
چکیده

This paper describes a method to detect errors in written text which requires no manual work. The method used is to simply annotate a lot of errors in written text and train an off-theshelf machine learning implementation to recognize such errors. To avoid manual annotation synthetically created errors are used for training. The method is evaluated on erroneously split compounds and word order errors. Results are comparable to a state of the art grammar checker based on manually created rules. The evaluation is performed on real (not synthetic) errors.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Error Taxonomy of TOEFL iBT Writing: An Iranian Perspective

TOEFL iBT has turned recently heads to the impacts language tests can have on language learning. Since error analysis-based instruction has gained a new life with the advent of the computer analysis of the learner’s language, the researchers of this study embarked on examining a sample of integrated and independent writing tasks of 45 Iranian TOEFL iBT candidates in order to identify and classi...

متن کامل

Iranian EFL Learners' Written Grammatical Errors: Different Levels of Language Proficiency

Errors are one of the enigmatic parts in the process of foreign language (L2) learning as they are extremely versatile at each and every stage of the language learning proficiency. The present study, therefore, was an attempt to reveal Iranian EFL learners’ grammatical errors in writing at two levels of proficiency, namely lower intermediate and advanced, and then to investigate whether there w...

متن کامل

Outlier Detection Using Extreme Learning Machines Based on Quantum Fuzzy C-Means

One of the most important concerns of a data miner is always to have accurate and error-free data. Data that does not contain human errors and whose records are full and contain correct data. In this paper, a new learning model based on an extreme learning machine neural network is proposed for outlier detection. The function of neural networks depends on various parameters such as the structur...

متن کامل

Grammatical Error Correction of English as Foreign Language Learners

This study aimed to discover the insight of error correction by implementing two correction systems on three Iranian university students. The three students were invited to write four in-class essays throughout the semester, in which their verb errors and individual-selected errors were corrected using the Code Correction System and the Individual Correction System. At the end of the study, the...

متن کامل

Human errors identification in operation of meat grinder using TAFEI technique

  Background: Human error is the most important cause of occupational and non-occupational accidents. Because, it seems necessary to identify, predict and analyze human errors, and also offer appropriate control strategies to reduce errors which cause adverse consequences, the present study was carried out with the aim of identifying human errors while operating meat grinder and offer sugg...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004